A Two-staged ClusteringAlgorithm for Multiple Scales

نویسندگان

  • Chien-Lung Chan
  • Rung-Ting Chien
چکیده

Cluster analysis is a data mining technique used to identify hidden patterns within data. Most clustering algorithms treat different fields of data with equal weights and calculate the “distance” using the same method. They ignore the fact that different fields of data have different scales; therefore, the “distance” should be calculated differently. This study incorporated a traditional clustering algorithm with expert subjective judgment, and used different methods to calculate the degree of similarity for four different scales -nominal, ordinal, interval and ratio. This study proposes a two-staged clustering algorithm to improve the process. In the first stage, training data was used to determine the parameters that improved clustering quality. In the second stage, different methods were used to calculate the degree of similarity for four different scales of data and treated different fields with unequal weights. To evaluate the outcomes of this proposed clustering method, four standard data sets were used for testing. They were the Wisconsin Breast Cancer Data, Contraceptive Method Choice Data, Iris Education Data, and Balance Scale Weight & Distance Data. The results were positive; the algorithm using multi-scales resulted in a better quality clustering. Also, the algorithm incorporating expert subjective weighting had better accuracy in clustering.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of miR-24 and miR-137 as novel candidate multiple sclerosis miRNA biomarkers using multi-staged data analysis protocol

Many studies have investigated misregulation of miRNAs relevant to multiple sclerosis (MS) pathogenesis. Abnormal miRNAs can be used both as candidate biomarker for MS diagnosis and understanding the disease miRNA-mRNA regulatory network. In this comprehensive study, misregulated miRNAs related to MS were collected from existing literature, databases and via in silico prediction. A multi-staged...

متن کامل

Stability Analysis of a Strongly Displacement Time-Delayed Duffing Oscillator Using Multiple Scales Homotopy Perturbation Method

In the present study, some perturbation methods are applied to Duffing equations having a displacement time-delayed variable to study the stability of such systems. Two approaches are considered to analyze Duffing oscillator having a strong delayed variable. The homotopy perturbation method is applied through the frequency analysis and nonlinear frequency is formulated as a function of all the ...

متن کامل

Application of the method of multiple scales for nonlinear vibration analysis of mechanical systems with dry and lubricated clearance joints

In this study, the method of multiple scales is used to perform a nonlinear vibration analysis of a mechanical system in two cases; with dry and lubricated clearance joints. In the dry contact case, the Lankarani-Nikravesh model is used to represent the contact force between the joined bodies. The surface elasticity is modeled as a nonlinear spring-damper element. Primary resonance is discussed...

متن کامل

Bilateral Staged Total Hip Replacement and the Natural Progress of an Untreated Case of Developmental Dysplasia (Dislocation) of the Hip: A Clinical Case Report by the Surgeon and the Patient

The natural history of an untreated case of a Developmental Dysplasia (Dislocation) of the Hip (DDH) associated with multiple congenital abnormalities is reported in a 55-years-old man. The patient’s complaints and the varieties of the typical manifestations emerged in other parts of the body throughout the life are reviewed and discussed as comorbidities of a dysplastic condition. Two-stage bi...

متن کامل

Cytogenetics and Revised International Staging System (R-ISS): Risk Stratification in Multiple myeloma - A Retrospective Study in Indian Population

Background & Objective: Cytogenetic abnormalities in Multiple myeloma (MM) has emerged as the most important factor that determine the prognosis and survival. Fluorescence in situ hybridization (FISH) can detect a greater number of cytogenetic abnormalities as compared to conventional karyotyping and hence has become the standard test in determining genetic abnormalities in MM....

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJEBM

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2005